
Title : System and Method for Reduced Codebook Vector Quantization. 
FIELD OF THE INVENTION 

5 The present invention relates generally to the field of signal processing, more 

particularly to the field of vector quantization as applied to lossy signal compression. 

BACKGROUND OF THE INVENTION 

10 Although the present invention may be directed toward any number of signal 

compression areas, to aid the reader in understanding the present invention, we refer 
to a single area by way of example. This example being the compression of data that 
adds texture to a digital image. 

15 Conventional graphics systems such those as found in personal computers and 

home video game computers, use a frame buffer to store all the graphic data 
information that will be displayed on the computer screen. A graphics engine must 
"render" or draw graphics information into the frame buffer. Textures such as bumps, 
scratches, and surface features were not modeled by early graphics rendering engines. 

20 Rather, extremely smooth surfaces were constructed over a framework of graphics 
primitives such as polygons and vectors. Current graphics engines map textures onto 
these surfaces to replace artificially smooth surfaces with realistic detail. Examples of 
a texture of an object include the grass on a lawn, or the skin-tone variations and 
wrinkles on a human face. 

25 

A texture map is comprised of texels (texture elements) that are stored in 
texture memory. Texture memory is a scarce resource, so in order to efficiently use it, 
the digital signal representing the texture map is often compressed with a fixed 
compression ratio. 

30 
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U.S. patent 5,822,452 discloses a method and system for "compressing and 
decompressing a texture image". This method and various obvious improvements and 
modifications have been widely studied and adopted. The method is the following: a 
compression color space is selected either manually or using a neural network, each 
texel in the texture image is converted to an 8-bit value in the selected color space, 
and a decompression table is generated that represents the RGB values for each texel 
stored in the selected color space. When rendering a pixel representing an object with 
a texture, the texture image is mapped to the representation of the object, and one or 
more texels that are associated with each pixel are decompressed. 

The inventors in U.S. patent 5,822,452 go to great lengths to describe their 
neural network algorithm for selecting the compression color space. In fact, this 
method is an ad hoc, heuristic, and sub-optimal example of a gradient descent method. 
Neural networks are frequently found to produce performance that may be superior to 
random guessing for poorly characterized and/or mathematically intractable 
optimization problems. The neural network as disclosed operates by iteratively 
modifying the choice of color space such that for each individual texel value, in turn, 
the distortion is lowered. However, lowering the distortion for a particular input may 
raise the distortion for the rest of the inputs leading to a net overall increase in 
distortion. In practice, more often than not, by using ad hoc techniques such as these, 
supplemented with user intervention to tune various optimization parameters, 
acceptable performance may be realized, but with much greater effort, both 
computational and human, than what may be possible with a principled approach. 

Although the above referenced prior art patent deals specifically with the 
compression of color spaces, there is a more general need for a simple improved 
method of optimally compressing digital signals. In other words, an improvement to 
manual, neural network, and other ad hoc approaches. The present invention 
addresses this need. 



SUMMARY OF THE INVENTION 

The present invention relates to a system and method for utilizing a modified 
LCVQ algorithm to produce optimal codebooks for use in minimizing the content 
required in a digitized data stream. 

One aspect of the present invention is a modified LCVQ method for creating 
an optimal codebook; the method having the steps of: 

a) initializing a base codebook B with a set of base codevectors; 

b) finding the nearest neighbour for each source vector in a block of data and 
associating each source vector with a base codevector in the base codebook B; 

c) creating codebook C based upon the formula, C =B*W; 

d) updating the base codebook B utilizing the following formula: 
B = SW T * inverse(WNW T ); and 

e) repeating steps b) to d) until the base codevectors in base codebook B have 



In another aspect of the present invention, there is provided a computer system 
for creating optimal codebooks, the system having: 



converged. 



a) 
b) 



a general purpose computer; 

memory connected to said general purpose computer; 



c) a compression module resident in memory b); where the compression module 
comprises; 

i) an initialization module for calculating initial values for a base 
codebook B, the initialization values being source vectors from a block of data; 

ii) a nearest neighbour module, the nearest neighbour module accepting 
base codebook B as input from the initialization module and assigning each source 
vector in the data block to a vector in base codebook B; and calculating a codebook C 
based upon the values in base codebook B; 

iii) a centroid module, the centroid module recalculating the values 
contained in base codebook B to improve convergence; 

iv) a convergence module, said convergence module determining if the 
centroid module has converged the values in base codebook B; and 

d) a graphics engine connected to the compression module. 

In another aspect of the present invention there is provided a system for 
creating an optimal codebook; the system having: 

a) means for initializing a base codebook B; 

b) means for creating a codebook C from base codebook B; 

c) means for recalculating base codebook B; and 

d) means for determining if the contents of base codebook B have converged. 

In yet another aspect of the present invention, there is provided a computer 
readable medium containing instructions for creating an optimal codebook; the 
instructions having the steps of: 

a) initializing a base codebook B; 

b) finding the nearest neighbour for each source vector in a block of data and 
associating each source vector with a codevector in base codebook B; 

c) creating a codebook C based upon the formula, C =B*W; 



d) updating base codebook B utilizing the following formula: 
B = SW T * inverse(WNW T ); and 

e) repeating steps b) to d) until the values in base codebook B have 
converged. 

5 

BRIEF DESCRIPTION OF THE DRAWINGS 

For a better understanding of the present invention, and to show more clearly 
10 how it may be carried into effect, reference will now be made, by way of example, to 
the accompanying drawings which aid in understanding an embodiment of the present 
invention and in which: 

Figure 1 is a plan view of a set of source vectors and codevectors in a two 
1 5 dimensional space; 

Figure 2a is a block diagram of an encoder; 
Figure 2b is a block diagram of a decoder; 

Figure 3 is a block diagram of a computer system in which the present 
invention operates; 

20 Figure 4 is a flow chart illustrating the logical components of the present 

invention; and 

Figure 5 is a block diagram of the components of the compression module. 

25 DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The present invention is directed toward a system and method of compressing 
image signals. Vector quantization is commonly used to compress image signals. 
Vector quantization takes as input a set of source vectors and calculates a smaller set 
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of codevectors. These codevectors are then stored in a codebook. To better illustrate 
this concept, we refer now to Figure 1. 

Figure 1 is a plan view of a set of source vectors and codevectors in a two 
5 dimensional space shown generally as 10. Source vectors 12 are represented by an 
"x" and codevectors 14 are represented by a solid block in Figure 1. The two 
dimensional space 10 has been divided into a number of Voronoi regions 16, by a 
process well known in the art. Each region 16 will have a single codevector 14 which 
is stored in a codebook (Figure 2). Codevector 14 is the nearest neighbour to the 

10 source vectors 12 contained within a region 16. The present invention is directed 
toward the determination of a set of optimal codevectors 14. For the purposes of this 
disclosure, space 10 is considered to be an 8x8 block of source vectors representing 
pixels or texels. As such, one would expect space 10 to contain 64 source vectors. As 
can be seen from Figure 1 this is not the case as including 64 source vectors would 

15 have made Figure 1 somewhat "busy". Further, it is not the intent of the inventors to 
absolutely require space 10 to be 8x8. In the preferred embodiment it is 8x8, 
however, it may be any dimension deemed by the implementer to be the most 
efficient size for the implementers use of the present invention. 

20 Figure 2a is a block diagram of an encoder shown generally as 20. Encoder 20 

comprises search engine 22 and codebook 26. Codebook 26 comprises a plurality of 
codevectors 14. Search engine 22 accepts as input source vector 12 and searches for 
the codevector 14 that is the nearest neighbour to source vector 12 in codebook 26. 
Link 24 illustrates a link to the nearest neighbour vector located by search engine 22, 

25 i.e. a codevector 14. Each codevector 14 in codebook 26 has a corresponding index 
28. Once search engine 22 has located the appropriate codevector 14, encoder 20 
outputs via communication channel 30 the index 28 for the codevector 14. Only the 
index is passed, not the entire vector, this reduces bandwidth on channel 30. 
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Referring now to Figure 2b, a diagram of a decoder is shown generally as 40. 
Decoder 40 takes as input an index 28 passed on channel 30 by encoder 20. Index 28 
is then used to locate the corresponding codevector 14 in codebook 26. Codevector 
14 is output via link 42 as output vector 44. 

5 

We will now discuss in general the concepts of compression and the specific 
method used by the present invention. 

Effectiveness of any lossy compression technique can be measured in terms of 
10 the distortion introduced in order to obtain a required compression ratio. The 
distortion can be expressed in terms of the square difference between corresponding 
original and decompressed pixels averaged over the image (i.e., mean squared error 
for which less is better). The compression ratio (more is better) is the number of bits 
in the input divided by the number of bits in the compressed code that is the output of 
15 the compression apparatus. Ideally compression would be achieved with zero 
distortion (no loss). Lossless compression techniques introducing no distortion do 
exist. For typical inputs these methods will achieve modest compression ratios on 
average. However, for any lossless technique it is possible to find an input that will 
result in zero-compression, or even expansion of the signal. It is well known in the art 
20 that a truly random input is not compressible without loss. For this reason, when a 
fixed or minimum compression ratio must be consistently achieved, only lossy 
techniques are applicable. 

A conventional example of lossy compression is to replace each 8-bits of each 
25 color component of a vector value that was in a red-green-blue (RGB) color space 
(RGB888) with a number of its most significant bits (MSBs) in a reduced color space 
(e.g., RGB565). That is, a 24-bit value is compressed in a 3:2 ratio by replacement 
with a 16-bit value, by reducing the number of bits to 5 -bits of red, 6-bits of green, 
and 5-bits of blue. To achieve a 3:1 compression ratio RGB332 could, theoretically, 
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be used, but practically the decompressed image quality is much too poor when the 
data is reduced in this straightforward manner. 

Recent methods achieve acceptable visual quality with high compression 
5 ratios (e.g, 6:1). One broadly used technique is a linearly-constrained vector 
quantization (LCVQ) representation of blocks of vectors. 

In a vector quantizer, lossy compression is introduced by representing vectors 
with a reduced number of codevectors that introduce only an almost imperceptible 

10 distortion when replacing the original vectors. That is, each vector in a block of T 
vectors (e.g., T = 16 vectors in a 4x4 block from the texture image) is represented by 
M possible Y-bit codevectors (e.g., M = 4, Y = 2) such that a 2-bits look-up table 
index is required to specify which codevector is used to represent a given vector. 
Linearly-constrained VQ places the restriction on these M codevectors that they must 

15 be formed from a simple linear combination of a much smaller set of L base 
codevectors (e.g., L = 2). For example, a codevector CI may be constructed from two 
base codevectors through a linear construction such as: 

CI = 0.5*B1 +0.3*B2 + 0.2*B3 

20 

The above referenced linear equation is illustrative only, as the dimension of base 
codevectors may be greater than three and the weights for each dimension of B will 
vary. 

25 This restriction is motivated by the fact that the complete compressed 

representation of each block of T vectors requires two sets of information: a block 
specific set of base codevectors, and T individual indices. Each of the T vectors in a 
block is represented by a Y-bit index specifying their representative codevector from 
the matrix C. Since the L base codevectors that form the base codevector matrix B 

30 are also unique for each block of T vectors, these base codevectors also form part of 
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the compressed representation of the block. This flexibility allows the codevectors to 
adapt to colors present in each unique image block. For example, if L=2 and the base 
codevectors are stored in RGB565 format then each block requires two 16-bit base 
codevectors and 16 2-bit indices, for a total of 64-bits (4-bits per vector). This 
5 compression ratio is therefore twice that of the method described in U.S. patent 
5,822,452, which requires 8-bits per vector. 

We will now describe a principled, mathematical description of an LCVQ 
representation of a block of a texture image. Let X be a D row by T column matrix. 

10 For the example of texel compression, D=3 is the number of components in each 
texel/vector, thus D represents the three values of RGB for each texel/vector. The 
value T is the number of vectors in an image block I, each vector in that block forms 
one column of X. Let C be a D row by M column matrix containing the current 
codevectors for block I. Let N be a M by M diagonal matrix with the number of 

15 vectors in block I which are closest to each codevector (i.e. produce the least mse 
distortion and therefore assigned to that codevector) in matrix C appearing along the 
diagonal in the corresponding column of matrix N. Let S be a D by M matrix where 
each column contains the sum of the vectors currently assigned to each codevector. 
Let B be a D by L matrix where each column contains the current base codevectors 

20 for block I, such that OBW is found by matrix multiplication with a fixed L by M 
weight matrix that specifies what proportion of each base codevector is used to form 
the current codevectors that represent the vectors. 

For an example of present art, the mixed LCVQ format of S3 Inc., of Santa 
25 Clara, California, uses T=l 6=4x4, L=2, M=4, and RGB=565 for the base codevectors. 
This is one of the most widely accepted formats used for texture compression for 3D 
graphics. Each subblock of 4x4 pixels is coded with a four entry codebook (2 bits per 
pixel), where two of the entries are derived by linear interpolation from the two base 
codevectors that are quantized in 565 format. The red, green, and blue (RGB) 
30 channels are quantized to five, six, and five bits respectively. 
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As a further example of present art, the high LCVQ format utilized by 3dfx 
Interactive Inc., of Alviso, California, uses T=32=4x8,L=2,M=8, and RGB=565 for 
the base codevectors. Each subblock of 4x8 pixels is coded with eight codevectors (3 
bits) derived by linear interpolation from two base codevectors that are quantized in 
565 format. 

Finally, the chroma LCVQ format of 3dfx uses T=32=4x8, L=M=4, and 
RGB=565 for the base codevectors. Each subblock of 4x8 pixels is coded with four 
codevectors (2 bits) that are stored in 565 format. This accommodates complex color 
regions; however, since no codevectors are derived, it is a block VQ rather than block 
LCVQ format (the weight matrix is the identity matrix). 

The problem with these 3 examples of prior art is that: 

1) For blocks with very complex color (having more than 8 visibly distinct colors) 
M is too small. In particular, steep color gradients arising from the gradual linear 
blending of one color with another quite distinct color are not rendered as well as 
would be possible with a larger M. 

2) To obtain the flexibility of M=8 (a larger number of distinct colors in a 
texture/image block), it is necessary that all colors in a block be obtainable from 
linear interpolation of two base colors. While this model works well for color 
gradients, complex color textures rarely follow this model. That is, in the prior 
art the derived colors (codevectors) are constrained to lie along a line in the 3D 
(RGB) color space. We would like 2D and 3D linearly constrained models that 
allow the derived colors to either lie along a plane in the color space, or to occupy 
the 3D volume of the colorspace respectively. 
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Thus, the present invention is directed toward finding an improved system for 
finding a superior base codebook B for each block of vectors of a texture image. The 
present invention makes use of a general algorithm that is computationally scalable, 
and that guarantees convergence towards a minimum distortion compressed 
representation for the wide class of signal compression techniques that use "linearly- 
constrained vector quantization" (LCVQ). 

Given this framework, the generalized Lloyd algorithm (GLA/LBG) may be 
fundamentally modified and extended to a new algorithm that has similar properties to 
what the original GLA/LBG has for unconstrained vector quantization. The new 
linearly-constrained generalized Lloyd algorithm (LCGLA) has two steps: 

1) Find the nearest neighbour: each vector is assigned to its nearest codevector in C. 

2) Centroid: the base codevector matrix B is updated with the following formulae: 
B =SW T * inverse(WNW T ) 

C=BW 

Furthermore, by limiting the number of iterations of these two steps that are 
performed, it is possible to limit the computational complexity of the algorithm to a 
fixed quantity - making it suitable for hardware implementation, (i.e. the method is 
computationally scalable) 

The present invention works with both two-dimensional and three-dimensional 
LCVQ codebooks. 

The present invention provides LCVQ formats that are more effective than 
those used in the present art in which linear interpolation of base codevectors to form 
the codebook C is the sole technique. Also, a larger block size of 8x8 is proposed 
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rather than a block size of 4x4 or 4x8 such that more distinct colors are possible 
within a block. Three such formats are the following: 

1) ID LCVQ (T=64=8x8,L-2,M=ll,RGB=565). A single block of 8x8 pixels 
uses less than 3.5 bits per pixel for choosing the color of each pixel, and .5 bit per 
pixel for representing the base codevectors. This format is ideal for high quality 
representation of steep color gradients, as commonly results in obvious artifacts 
representing such textures as "sky" and "water" with the prior art. A steep color 
gradient would be a gradual change from [0,0,0] to [255,255,255] within one 8x8 
block. For example, white at the left side of the block, black at the right side of the 
block and different levels of gray in between. Another example would be a gradual 
change from red [255,0,0] to blue [0,0,255] or from any two colors that are widely 
separated in their [r,g,b] representation. 

A block of 8x8 pixels would use a weigh matrix of two rows namely: 

Row 1)1, (k-l)/k, (k-2)/k, ... 0 
Row 2)0, 1/k, 2/k, ... 0 

where k is typically 3 or 7 and the number of columns in the weight matrix W is k+1 
such that there are 2 base vectors in B and (k+1) derived vectors in codebook C. 

2) 2D (T=64=8x8,L=3 5 M=8,RGB=777). A block of 8x8 pixels uses the weight 



matrix 



1 0 .25 -.25 1 -1 0 -1 



W = 



0 1 .25 -.25 -1 1 -1 0 



0 0 .5 



1.5 1 1 2 2 
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With this example, colors are no-longer constrained to lie along a line between two 
base colors, but are constrained to lie in the plane defined by three base-colors. An 
obvious example when this format will outperform prior art is a color gradient in 
which three colors are mixed. 

3) 3D (T=64=8x8, L=4,M=8,RGB=565): A block of 8x8 pixels uses the weight 
matrix 



W = 



8 0 0 1 1 3 4 
0 8 0 3 4 4 1 /8 
0 0 8 4 3 1 3 



With this format, colors in the derived codebook C fill the entire volume of RGB 
colorspace. An obvious example when this sort of flexibility is required is a highly 
colorful texture such as small, colorful beads. The S3 Inc. mixed format also has this 
property that colors are not constrained to a plane or line in the 3D colorspace, but to 
obtain this property it limits the number of colors in a block to M=4. By using this 3D 
LCVQ format we obtain this same property while allowing M=8, eight distinct colors 
in each block. 



The only restriction on a weight matrix W is that it be non-singular. In other 
words, there are many different possible matrices that may be used instead of the ones 
suggest above by the inventor. For example, one efficient method of utilizing the 
present invention may be to pick a set of different weight matrices W that provide 
good distortion performance for representing certain types of images and allow one of 
the set to be used for each block, as specified on a block by block basis. 
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In summary, when quantizing a texture image, the image is partitioned into 
non-overlapping blocks of 8x8 vectors/pixels. These blocks are compressed, 
communicated, and decompressed in conventional raster-scan top-left to bottom-right 
order. Within each 8x8 block, the vector/pixels themselves are al§o represented in 

5 conventional raster-scan top-left to bottom-right order by an index which indicates 
what codevector is used to represent each vector. The base codevectors (B) 
themselves are present in the compressed representation of each block and are 
transmitted via channel 28 (see Figures 1 and 2). The matrix W is stored in both the 
encoder 20 and decoder 40. Thus, through the use of fixed matrix W and dynamic 

10 matrix B, the codevectors in codebook 26 (i.e. matrix C) are dynamically derived by 
matrix multiplication. By transmitting B, bandwidth is saved by not requiring the 
transmission of C. In addition, as discussed earlier, a number of matrices W may be 
employed by both the encoder and the decoder and an optimal matrix W selected for 
each individual block. 

15 

Referring now to Figure 3 a block diagram of a computer system in which the 
present invention operates is shown generally as 100. In the preferred embodiment, 
the computer system 100 is a conventional personal computer such as an IBM 
compatible PC with a conventional DirectX 5.0 (or later) compatible graphics engine 
20 114, and a non-conventional compression module 122 stored in conventional random 
access memory 104. 

DirectX is a suite of multimedia application programming interfaces provided 
by Microsoft Corporation. DirectX provides a standard development platform for 
25 Windows-based PCs by enabling software developers to access specialized hardware 
features without having to write hardware-specific code. 

Processor 102 of computer system 100 is a processor, capable of running the 
DirectX functions, such as a Pentium series processor commercially available from 
30 Intel Corp. Processor/memory bus 106 and I/O (input/output) bus 110 are 
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conventional. A conventional I/O bus controller 108 controls the data flow between 
I/O bus 110 and processor/memory bus 106. Conventional I/O devices 116, such as a 
keyboard and disk drive, are connected to I/O bus 110. A conventional computer 
monitor 1 12 is driven by the graphics engine unit 114. 

For an embodiment in which the graphics engine 114 is non-conventional, 
texture data is communicated across processor/memory Bus 106 5 I/O Bus Controller 
108 and I/O Bus 110 from compression module 122 to graphics engine 1 14 in at least 
one of the following non-conventional formats for each block of 8x8 pixels from the 
original texture image: 

1) ID: 

(1 1 codevectors are derived from the base codevectors) 
base codevector 1: 5-bits red, 6-bits green, 5-bits blue 
base codevector 2: 5-bits red, 6-bits green, 5-bits blue 

32 seven bit indices: each index (ranging from 1 to 121) indicates for two 

adjacent source vectors, which codevectors (between 1 and 11) they are 

represented by. The forward and backward mapping from a paired index to 

individual indices is given as: 

vector Aindex = floor[PairedIndex/l 1] 

vectorB index = mod (Pairdlndex-l, 11) + 1 

Pairedlndex = vectorAindex * vectorBindex 

In this way, the index for each individual source vector is represented in 3.5 bits. 
The fractional bit representation is made possible by pairing each vector with one 
other vector. In other words, with 64 source vectors there are 32 pairs of source 
vectors represented by 32 seven bit indices. 

2) 2D: atotalof255-bits 

(8 codevectors are derived from the base codevectors) 
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base codevector 1 : 7-bits red, 7-bits green, 7-bits blue 
base codevector 2: 7-bits red, 7-bits green, 7-bits blue 
base codevector 3: 7-bits red, 7-bits green, 7-bits blue 

64 3-bit indexes: each index indicates which codevector represents an individual 
vector. 

3) 3D: atotalof256-bits 

(8 codevectors are derived from the base codevectors) 
base codevector 1: 5-bits red, 6-bits green, 5-bits blue 
base codevector 2: 5-bits red, 6-bits green, 5-bits blue 
base codevector 3: 5-bits red, 6-bits green, 5-bits blue 
base codevector 4: 5-bits red, 6-bits green, 5-bits blue 

64 3 -bit indexes: each index indicates which codevector represents an individual 
vector. 

The pseudo code required to implement the preferred embodiment of 
compression module 122 is provided in Appendix 1 in Matlab format. The two 
algorithms, which may be written in any computing language, are used to encode each 
image block. The routine "Initcluster" is used to find a random initial base codebook 
B given an 8x8 block of 64 vectors that are the columns of the matrix X. The routine 
"lgbo". iteratively improves any codebook B using the unconventional LCGLA 
algorithm, which is a modified version of the convention generalized Lloyd algorithm 
that functions for linearly constrained codebooks C. 

Figure 4 is a flow chart of the logical components of the present invention, 
shown generally as 200. The following description provides an overview of the 
logical components of Figure 4. For specific implementation details, we refer the 
reader to Appendix 1. At step 202 the matrix B is randomly initialized without 
replacement from the 64 source vectors contained in the current 8x8 block of data 
(i.e. the space 10 of Figure 1). This initialization step is known in the art and many 
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others exist. It is the intent of the inventors that any form of initialization may be 
utilized when practicing the present invention. At step 204 the value of matrix C is 
set to: 

C = B * W; 

Matrix B is modified so that each vector in B is a "nearest neighbour" to the source 
vectors. This step locates the Voronoi regions of the codevectors. Then each of the 
64 source vectors are assigned to their nearest neighbour codevector in codebook C. 

Should the result of the creation of B provide a singular matrix, than B is 
reinitialized at step 206 to contain an outlying source vector, i.e. a vector that is far 
from what will ultimately be a code vector and control is returned to step 204. As 
one skilled in the art will recognize there are many methods to reinitialize matrix B so 
that it is not singular. The use of an outlier to reinitialize B is only a single example 
suggested by the inventors to resolve the problem of the matrix B being singular. 

At step 208 matrix B is recalculated so that C (which is derived from B) 
provides a minimum distortion of the 64 source vectors in the 8x8 block. This is done 
by updating B based upon the formula: 

B = SW T * inverse(WNW T ) 

At step 210 it is determined if the values in B have converged, i.e. B now 
contains the locally optimal set of base codevectors that are used to derive the 
codevectors in codebook C. If this is not the case, the process repeats by returning to 
step 204 . 

To further illustrate the invention, we refer now to Figure 5 a block diagram of 
the components of compression module 122 (see Figure 3). Module 122 performs the 
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functions illustrated in the flowchart of Figure 4. Initialize module 252 randomly 
initializes matrix B, based upon the source vectors contained in the current block of 
data (i.e. the space 10 of Figure 1). Nearest neighbour module 254 modifies matrix B 
so that each vector in B is a "nearest neighbour" to the source vectors and calculates 
5 matrix C. Centroid module 256 calculates the centroid of all the base vectors 
contained in B and corresponds to step 208 of Figure 4. Convergence module 258 
determines if the vectors in B have converged and if not, control is returned to nearest 
neighbour module 254. 

10 In an alternate embodiment, system 100 is a video game platform or a PC with 

a graphics card supporting alternative conventional industry standard LCVQ-based 
texture compression methods. 

In another alternate embodiment, the graphics engine 114 is non-conventional 
15 and supports 2D and 3D LCVQ formats. 

It is not the intent of the inventor to limit the present invention to vector quantization 
for images. As one skilled in the art of signal compression will appreciate, the present 
invention has use in any area of signal compression or pattern recognition where 
20 linearly-constrained nearest neighbour techniques are used. The use of the present 
invention for vector compression of texture images is meant to be illustrative of one 
use. As an example of a use other than compression of video images, the present 
invention may be utilized in pattern recognition and in particular optical character 
recognition. 

25 

While the present invention minimizes mean squared error for LCVQ, the 
extension of the algorithm to minimize weighted mean squared error and other such 
commonly practiced modifications of the error criterion as considered from VQ in the 
present art, are considered by the inventor to be extensions to the present invention 



# 



requiring only the transfer of VQ techniques to LCVQ as would be obvious to anyone 
skilled in the art. 

Although the invention has been described with reference to certain specific 
5 embodiments, various modifications thereof will be apparent to those skilled in the art 
without departing from the spirit and scope of the invention as outlined in the claims 
appended hereto. 
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